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Abstract 

We describe how to create ab-initio effective Hamiltonians that qualitatively describe cor¬ 
rect chemistry even when used with a minimal basis. The Hamiltonians are obtained by fold¬ 
ing correlation down from a large parent basis into a small, or minimal, target basis, using the 
machinery of canonical transformations. We demonstrate the quality of these effective Hamil¬ 
tonians to correctly capture a wide range of excited states in water, nitrogen, and ethylene, and 
to describe ground and excited state bond-breaking in nitrogen and the chromium dimer, all in 
small or minimal basis sets. 


1 Introduction 

The rapid evolution of quantum chemistry over the last decades means that in many molecules 14 
and even in some condensed phase systems, 5 the combination of many-electron correlation meth¬ 
ods with large basis sets provides predictions to beyond chemical accuracy of 1 kJ/mol. Despite 
these numerical advances, qualitative chemical and physical reasoning necessarily remains rooted 
in simple concepts. 

*To whom correspondence should be addressed 
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One way to connect quantitative calculations to qualitative understanding is to construct an 
effective Hamiltonian to describe the correct correlated behaviour in terms of only the minimal 
chemical degrees of freedom, i.e. a minimal basis. Semi-empirical methods define such Hamil¬ 
tonians by fitting to precomputed observables, but relying on empirical parametrization removes 
many advantages of predictive computation. A more satisfying route to effective minimal basis 
Hamiltonians is via rigorous ab-initio many-body theory. In this work, we will construct minimal 
basis effective Hamiltonians by rigorous many-body canonical transformations. While in principle 
an exact procedure, in practice approximations are necessary. We will thus be primarily concerned 
with addressing two questions of approximation. First, how do we define a simple, cheap, and 
stable, approximate canonical transformation to obtain the effective minimal basis Hamiltonian? 
And second, how well do such minimal basis Hamiltonians capture non-trivial chemistry, at least 
at a qualitative level? 

We must use an effective as opposed to a bare Hamiltonian in a minimal basis because it is 
well established that quantum chemistry with the bare minimal basis Hamiltonian is exceedingly 
poor. This is because the electrons in filled orbitals cannot avoid each other, and the Coulomb 
interaction is felt too strongly. Modifying the Coulomb interaction to take into account excursions 
of electrons into orbitals external to the minimal basis is referred to as folding in the (effects of the) 
external orbitals. Alternatively, since the effective Coulomb interaction is decreased in magnitude, 
this process is often referred to as screening. 

The effective Hamiltonian of interest depends in part on the choice of many-body formalism. 
For example, in Green’s function approaches, we can define an effective two-particle (four-point) 
interaction operator T(1234) (where the labels 1,2,3,4 include both time and orbital indices) that 
yields the appropriate two-particle (four-point) Green’s function AT( 1234) in the minimal basis, 
using a Dyson-like equation 6 


K = (GG) + (GG)r(GG) 


( 1 ) 
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where G is the single-particle Green’s function. The interaction operator T depends on time. When 
limited to the particle-hole channel, it is called the screened interaction, and is commonly computed 
within the random-phase approximation. 7 8 

Here, however, we are concerned with the effective Hamiltonian H to be used in the many-body 
Schrodinger equation in the minimal basis. 


— Ei\<t>i) ( 2 ) 

The state |d>,} exists only in the Hilbert space of the minimal basis, but is related to the exact 
eigenstate | V P,) in the full space by a many-body canonical transformation Tf) = £/|<f>,), thus 
H = U T HU. Unlike the interaction operator T, the effective Hamiltonian H is time-independent, 
but in general contains three-body and higher body terms. The Hilbert space that |<f>,) lives in can 
be thought of as spanned by a basis of quasi-particles associated with second-quantized operators 
fl(t) = £/t fl (t)£/. 

The basic formalism of (canonically transformed) effective Hamiltonians is very old and well- 
known. There are two families of approximation methods. The first focuses on the effective Hamil¬ 
tonian itself and dates all the way back to Van Vleck 9 and other early workers such as Brandow, 10 
Wcsthaus, 11 Freed,S 2 13 and others. 14 Also in the first family are the renormalization group ap¬ 
proaches to H, based on successive iterative approximations, as developed by Wegner, 15 Glazek 
and Wilson, 16 and White. 17 The second family of methods focuses more on the eigenstates | V F ; ) 
and their associated wave operator. This includes the many variants of coupled cluster theory, 18 
and especially the equation-of-motion 19 and multireference extensions. 20 There is much overlap 
between the families and there are methods which belong to both (such as the earlier canoni¬ 
cal transformation work of Yanai, Neuscamman, and Chan, 21 24 the anti-Hermitian contracted 
Schrodinger equation of MazziottiJ 25 26 and the recent similarity renormalization group work of 
Evangelista 27 ). However, a defining difference is that in the presence of degeneracies and strong 
correlations, the first family usually adopts a “perturb and diagonalize” strategy, while the second 
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performs “diagonalize and perturb”. This difference is one of philosophy but can lead to different 
choices in approximations. 

We are concerned here with techniques in the first family to construct effective minimal ba¬ 
sis Hamiltonians. The earlier works by Freed, and by White, and especially the recent work by 
Yanai and Shiozaki 28 are, in our view, conceptually the most closely related. We will explore the 
approach of Yanai and Shiozaki in parallel to the new approaches in this work. In addition, the nu¬ 
merical approximations we use build on the earlier work on approximate canonical transformations 
by Yanai, Neuscamman, and Chan. 

In Section [2] we begin by precisely defining the effective Hamiltonian in a minimal basis. We 
then outline the earlier approximate canonical transformation formalism of Yanai, Neuscamman, 
and Chan, and next the detailed steps and approximations to construct the effective minimal basis 
Hamiltonians in this work. We proceed to assess the performance of our effective Hamiltonians for 
a variety of chemical phenomena, including electronic excited states and potential energy surfaces. 
We finish with a discussion of some future directions of this approach. 

2 Theory 

2.1 The effective Hamiltonian 

We begin with some notation. The effective Hamiltonian folds the effects of electron correlation 
from an initial large (possibly infinite) “parent” basis down to a smaller “target” basis. We label 
orbitals in the parent basis (assumed orthogonalized) by indices (v, A, /l, v}. Thus the parent basis 
Hamiltonian is written as 


H — ^jhlivEfiv T . ^ VjivKxEjivX k (3) 

jUV 4 /ivjcA 

where we use the spin-summed excitation operators v = E<re{a,/3} a lio a v< j, and V jlVK x represents 
the anti-symmetrized two-electron integral operator. 
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We label the target basis (assumed orthogonalized) by { p.q. r.s}. In this work, we take the 
target basis to be spanned by a set of minimal Gaussian basis functions, although in principle 
any small basis can be used. The (orthogonal) functions that are in the parent basis but which 
live outside of the target basis , define the external space; we will label these by {x,y}. When 
the parent basis is formally infinite (as in R12/F12 theory), the external space is represented, as 
necessary, by its complementary auxiliary basis 29 (discussed in detail in Section 2.4). Note that 


for two arbitrary finite parent and target Gaussian bases, the target basis is not usually a subspace 
of the larger parent basis. In that case, we consider the parent basis to be the union of the target 
basis and the original parent basis. For example, if using an ANO-RCC-MIN target basis 30 and an 
aug-cc-pVQZ 31 parent basis, we take the parent basis in numerical calculations to be ANO-RCC- 
MIN+aug-cc-PVQZ. The dimension of the external space is then the same as that of the original 
parent basis. 

The exact effective Hamiltonian in the target basis is 


1 
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H — hpqEpq + ^VpqrsEpq rs + pqrstuEpqrstu T • • • 


(4) 


where ... indicates additional higher-body interactions. H is related to H by the canonical trans¬ 
formation operator e A , using the Baker-Campbell-Hausdorff (BCH) expansion, 


R = e A He 


= H + [H,A] + -[[H,A],A} + ... 


(5) 


where A is the antihermitian excitation and de-excitation operator between the target and external 
space. A can be written in terms of 1-body, 2-body, and higher components, 


A —A\ +A 2 + ... 


— (Exp Epx ) + . bqxy {^xypq Epqxy) + ■ ■ ■ 


( 6 ) 


px 


pqxy 
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The amplitudes of A: t px , t pqxy , etc., are chosen to make all matrix elements between the target 
and external spaces vanish. This leads to equations of the form of generalized Brillouin condi- 


Hons,™ 


(S|[tf,(E,„|4>)=0, 

E xy „)}\<S>)=0,... (7) 

where |<f>) is any state in the target space. The complete Brillouin conditions define H uniquely, 
up to decoupled rotations within the target and external spaces separately. 

In the above equations we immediately observe the need for approximation. This arises in (i) 
truncating A in Eq. (??), (ii) handling the BCH expansion of H and the many-body interactions in 
Eqs. (??), (??), and (iii) solving the amplitude equations, Eq. (??). We now discuss approxima¬ 
tions in each of these areas. 

2.2 Approximate canonical transformations 

In defining approximate canonical transformations we are motivated by the prior experience of 
Yanai, Neuscamman, and Chan 21 222433 and Yanai and Shiozaki, 28 and by the accuracy require¬ 
ments in this work. Our goal here is qualitative accuracy for a broad range of phenomena, rather 
than highly quantitative (e.g. 1 kcal/mol) chemical accuracy for a particular single target state at a 
given geometry. 

For (i) and (ii), we re-use the main ideas in Yanai, Neuscamman, and Chan, namely we truncate 
amplitudes at the two-body level (A—A\ +A 2 ) and limit the effective Hamiltonian to two-body 
interactions by approximating higher-body terms. For qualitative accuracy, it is reasonable to 
truncate the BCH expansion through second-order perturbation theory as in the work of Yanai and 
Shiozaki. This gives H as 


HkH + [H,A] + ^[[F,A],A] 


( 8 ) 


6 






where F is a Fock operator (defined more precisely below). If F is defined using the Hartree-Fock 
density matrix, the expectation value of H with the Hartree-Fock determinant (<t>/j H is the 
Hylleraas second-order energy functional. 

Note that although accurate only through second-order, the effective Hamiltonian in Eq. (??) 
already involves three- body interactions, generated by the first and second commutators. However, 
approximating H by a two-body Hamiltonian is well supported by the success of semi-empirical 
and model Hamiltonians, which universally assume only two-body interactions. Decomposing 
three- and higher-body interactions into effective two-body interactions was already considered 
by Iwata and Freed in the effective valence Hamiltonian theory. 13 In the canonical transforma¬ 
tion theory of Yanai and Chan, this decomposition was systematized using the generalized normal 
ordering of Mukherjee and Kutzelnigg 34 (an extension of the density cumulant expansion to oper¬ 
ators). 35 37 This re-expresses the 3-body operators generated above, , by 

cfZ ~ 9( rf A <) - 36(rf A H A t) + A <0 + 24()f A y?/ u ) - 9 A j£) (9) 

where we drop explicit 3-body fluctuation operators and 3-body density cumulants. (The above 
term a^ is expressed in terms of spin-orbitals. The correct spin-summed expression is given in 
Ref.,® 8 but we use the spin-orbital decomposition in our implementation). Since both the cumu¬ 
lants and the expectation value of the 3-body fluctuation operators vanish for a determinant state, 
truncating the 3-body terms in Eq. (??) preserves the expectation value of H with any determinant, 
and thus the value of the standard Hylleraas functional. Denoting the truncated normal ordering 
approximation in Eq. (??) by the subscript (1,2), we obtain the approximate two-body H( i 2 ) as 

«(1,2) = H+ [//.A] (12) + i [[F,A],A] (1 2) (10) 

When not ambiguous, we drop the subscript (1,2) in the labelling of hfn 2 ), understanding that it 
is defined as above. 

Both the Fock matrix F and the normal ordering approximation in Eq. (??) require density ma- 
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trices, thus introducing state-specific information. To minimize the amount of initial state-specific 
computation with the bare Hamiltonian (in keeping with the perturb and diagonalize philosophy) 
we use only density matrices from the Hartree-Fock ground-state in the target basis unless other¬ 
wise specified. 

2.3 Approximate amplitudes 

We next discuss how to determine the amplitudes. In the original canonical transformation work 
of Yanai and Chan, the amplitude equations are solved with respect to an initial (multireference) 
state | v Fo), typically a ground-state complete active space wavefunction, using the normal-ordered 
approximate H and the Brillouin conditions in Eq. (??) with <t>) = | v Fo). This provides a route 
to add dynamic correlation to the strong correlations presumed captured by | v E / o). However, there 
are several reasons not to use this strategy here. First, solving the amplitude equations for a mul¬ 
tireference | v Fo) is expensive - similar in scaling to, but several times the cost of, a coupled cluster 
singles doubles calculation. This is overkill for the qualitative accuracy we target. More impor¬ 
tantly, there are fundamental numerical issues when solving the Brillouin conditions. The Jacobian 
of Eq. (??) has small eigenvalues, and without a cutoff of these eigenvalues hundreds of iterations 
of the equations may be required. These small Jacobian eigenvalues also appear in internally 
contracted multi-reference coupled cluster theory 39 42 as well as in the anti-Hermitian contracted 
Schrodinger equation 25 26 (mathematically equivalent to a Schrodinger picture formulation of the 
Brillouin conditions) leading to convergence difficulties in both these methods. 

Further, at a fundamental level, the small Jacobian eigenvalues arise because the amplitude 
equations are solved for a single reference state | v E / o), which may have no weight in certain orbitals 
or electron configurations in the target space. Thus the amplitudes obtained are biased towards the 
reference state. This is an advantage for high accuracy for a given state (as in the original canonical 
transformation theory, or as in coupled cluster theory) but is a liability for a qualitative effective 
Hamiltonian intended to describe many states on an equal footing. 

For both these reasons, here we construct approximate amplitudes in a manner that does not 
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require the iterative solution of ill-conditioned and state-specific equations. We start with an effec¬ 
tive Hamiltonian defined by the Hylleraas expression in Eq. (??), and use a Fock operator F built 
from the Hartree-Fock density matrix in the small target basis. Re-expressing this in the parent 
basis, F can be written in semi-canonical form, for computational efficiency, as 


FlA 0 
0 F ab 5 a b 

Fxy&x y 


( 11 ) 


where i, j and a, b denote occupied and virtual molecular orbitals of the Fock operator in the target 
basis, respectively. The solution of the Brillouin condition with <f>) = O/)) (where 0/j is the 
target basis Hartree-Fock Slater determinant) is the Moller-Plesset first-order wavefunction, with 
amplitudes 


tlx - 


£i t' v 


tijxy ~ 


V 


ijxy 


£i + £j 


( 12 ) 


where £, = Fa, e x = F xx , respectively. We can now use these amplitudes to construct H. The 
singles and doubles amplitudes respectively add the Hartree-Fock basis set correction from the 
external space, and the correlated MP2 contribution from the external space, to the zeroth order 
Hartree-Fock energy of H. 

Note, however, that the amplitudes in Eq. (??) are defined only between orbitals occupied in 
|d>£>) i,j, and the external orbitals x.y. To decouple other low-lying states in the target basis from 
the external space, we consider using other (non-ground-state) determinants |d>^) to define the 
amplitudes in Eq. (??), which occupy “active virtual” orbitals a,b,c (orbitals not occupied in the 
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lowest determinant). This defines additional amplitudes involving the active virtuals, such as 


tax — 


tabxy 


£a £x 

Vabxy 


£(i + £b ~ £\ — £ 


-y 


( 13 ) 


and similar amplitudes with mixed (active and occupied) indices, such as t*J. Such amplitudes 
involving “active virtuals” are omitted in the definition of the effective Hamiltonian in equation- 
of-motion coupled cluster theory, because they annihilate the lowest Hartree-Fock ground-state 
|T>d) and do not change the ground-state energy. However, they are necessary to remove the 
bias towards the ground-state in the resulting effective Hamiltonian. Unfortunately, if we use the 
additional amplitudes as naively written in Eq. (??), we obtain very poor results. This is because 
the virtual eigenvalues £ a , £b appearing in Eq. (??) are determined for the ground-state density 
matrix and Fock operator, rather than for the Fock operator corresponding to the excited state 
determinant As the eigenvalue difference between the active virtual orbitals and external 

orbitals often vanishes, Eq. (??) can even yield divergent amplitudes. 

Physically, the energy of an electron in one of the active virtual orbitals (j) a is not well approxi¬ 
mated by e a , but is rather much closer to the HOMO energy level. This is because in Hartree-Fock 
theory the virtual energy levels are optimized in the field of N rather than N — 1 electrons. The 
appropriate orbital relaxation effect would be properly included if we retained three-body opera¬ 
tors in the effective Hamiltonian (similar to treating triples excitations in coupled cluster theory) 
but are not properly captured in the (1,2) approximation. To partially take into account this 3- 
body effect we now introduce a simple approximation. In the definition of amplitudes involving 
an active virtual orbital, we replace the corresponding virtual Fock energy £ a by a single modified 
orbital energy, £ a . In the simplest case, we replace £ cl by the HOMO energy, but we can also view 
£ a as an adjustable parameter. (This modification of the active virtual denominators can also be 
justified from the viewpoint of degenerate perturbation theory, as argued by Iwata and Freed, who 
placed all occupied and active (valence) orbitals at the same average energy thus creating a truly 
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degenerate zeroth order Hamiltonian^ 1 ®). Thus, 


tax — 




L ajxy 


tabxy 


Vajxy 


£ x T £y £« £/ 

Vabxy 

£x T £y 2£ a 


( 14 ) 


With this regularization, we define A to include all amplitudes involving the additional active vir¬ 
tual labels, except for active-active to active-external ( abcx ). Including the latter class of ampli¬ 
tudes tends to lead to significantly worse results, presumably because it requires a more rigorous 
treatment of the higher body effects than this simple effective denominator. 


2.4 Comparison to the approach of Yanai and Shiozaki 

In their work on canonical transcorrelated Hamiltonians, Yanai and Shiozaki used a simple defi¬ 
nition of the amplitudes that also does not require the solution of amplitude equations, and which 
is appropriate when both the target and parent basis are large. Following Ten-no, 43 the first-order 
MP2 amplitudes in an infinite external basis are fixed by the cusp condition. Using an FI2 fac¬ 
tor / = cxp( — yr 12 ) to represent the beyond linear terms in the r \2 dependence, one obtains MP2 
amplitudes given by 


W™) = \(umnFu\ij) + \{amnFu\ji) (15) 

where a,/3 label the formal infinite external parent basis, and Q\i is the strong orthogonality 
projector. 28 

In this work, we replace the formal infinite basis by the large parent basis and view the formula 
(??) as a way to provide the corresponding amplitudes t ix , t ax . li jxy etc. We can include orbital 
relaxation effects by defining the singles as in Eq. (??). <2i2 ensures that the F12 excitations are 


11 





orthogonal to those in the target basis since 


012 = (1 — /?(!))(! -Pity) ~x{\)x{2) 


(16) 


and p{ 1) denotes a (one-particle) projector onto the orbitals (occupied and active virtual) in the 
target space, and jc( 1), to the external orbitals. 

Using these F12-derived amplitudes, we then construct H directly using the BCH expansion 
in Eq. (??). Alternatively, Yanai and Shiozaki used approximation “C” of Kedzuch el al . 44 to 
compute the double commutator [[F,A],A]: we denote the corresponding approximation, F12(C). 
The effective Hamiltonians computed in either approach are identical in the limit of a large parent 
basis, but for rapid convergence, one should choose the parent basis to be an auxiliary basis set 
specifically constructed for use in FI2 theories. 

Note that Eq. (??) defines not only the standard occupied to external amplitudes tfj, but also 
active virtual to external excitations t x J b as well. Thus using the complete set of FI2 amplitudes (as 
in the earlier canonical transcorrelated Hamiltonian of Yanai and Shiozaki) also dresses the Hamil¬ 
tonian for excited states. (Note that, following Yanai and Shiozaki, we do not include active-active 
to active-external semi-internal excitations). However, the amplitudes target only the universal part 
of the short-range correlations, as defined by the Coulomb cusp, and thus retain no state-specific 
information. Such state information is only represented by amplitudes in the target space. For this 
reason, we expect effective Hamiltonians derived using the universal FI2 amplitudes to be less 


accurate than those obtained using the orbital MP2 coefficients in section 2.3 when folding into 
small target bases, such as a minimal basis. 
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3 Results and discussion 


3.1 Effective minimal basis Hamiltonian energies 


As a first check of our effective Hamiitonian construction, we compute second order perturbation 
theory (MP2) ground-state energies in the parent basis, using the bare Hamiitonian H, and the cor¬ 
responding MP2 ground-state energies in the target basis, using the effective Hamiitonian defined 
in Eq. (??). For comparison, we also compute the MP2 ground-state energies in the target basis 
with the bare Hamiltonian, to show the effects of folding. To construct H we use both explicit 


orbital amplitudes (Sec. 2.3) as well as FI2 amplitudes (Sec. 2.4). We recall that despite the (1,2) 


approximation in the commutator expansion for H, the MP2 energies (using explicit orbital am¬ 
plitudes) in the parent and target basis should match exactly, up to orbital relaxation effects. The 
orbital relaxation effects are not captured completely simply because they are treated to infinite 
order in the Hartree-Fock calculation in the parent basis, but only to second order in our effective 
Hamiltonian. 

Tables |T[ |2] give the errors in the effective Hamiltonian MP2 energies relative to the parent 
basis for the water and nitrogen molecules. We use the single-^ contraction of Roos’ ANO family 
of bases sets, labeled ANO-RCC-MIN, and Dunning’s family of cc-pVXZ (X = D, T), labeled 
DZ and TZ target bases, and an aQZ parent basis (aXZ is Dunning’s aug-cc-pVXZ basis 4 S). The 
quoted reference MP2 aQZ energy does not include the additional basis functions from the target 
basis, but this has a negligible effect relative to the errors that we are discussing; for example, the 
MP2 water energy using the aQZ basis is -76.38278 £),, while using the union of the MIN and aQZ 
basis, it is -76.38405 Eh. 

For H constructed from orbital amplitudes, the error in the effective ground-state energy, due 
to incomplete orbital relaxation, is small: [24] mE), and [37] m£;, for water and nitrogen respec¬ 
tively, even in the smallest ANO-RCC-MIN target basis. This compares quite favourably with the 
error in the bare Hamiltonian ANO-RCC-MIN MP2 energy: 412 mE), and 529 mE), for water and 
nitrogen respectively. The error from incomplete orbital relaxation decreases as the target basis 
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size increases. 


Table 1: Difference between the MP2 ground-state energy in various target bases (ANO-RCC- 
MIN, DZ, TZ), and the MP2 aQZ (parent basis) ground-state energy, for the water molecule at 
the equilibrium geometry R = 1.80847 A, 9 = 104.5°. The H column denotes the (standard) MP2 
calculation with the normal bare Hamiltonian. The other columns refer to the calculations using 
the effective Hamiltonian. Orb. denotes H using orbital amplitudes, F12 denotes F12 amplitudes, 
FI2(C) denotes using approximation “C” in the double commutator, and F12+Ai denotes addi¬ 
tional singles excitations. All energies in E The MP2 aQZ energy is —76.38278 £/,. (The union 
of ANO-RCC-MIN and aQZ MP2 energy is -76.38405 E h ). 


Target Basis 

Error (H) 

Orb. 

Error (H) 

F12 F12(C) 

F12+Ai 

ANO-RCC-MIN 

0.4129 

0.0247 

0.2374 

0.1933 

0.1203 

DZ 

0.1520 

0.0330 

0.0760 

0.0930 

0.0412 

TZ 

0.0505 

0.0095 

0.0197 

-0.0337 

0.0119 


Table 2: Difference between the MP2 ground-state energy in various target bases (ANO-RCC- 
MIN, DZ, TZ), and the MP2 aQZ (parent basis) ground-state energy, for the nitrogen molecule at 
the equilibrium geometry R = 1.09768A. The H column denotes the (standard) MP2 calculation 
with the normal bare Hamiltonian. The other columns refer to the calculations using the effective 
Hamiltonian. Orb. denotes H using orbital amplitudes, F12 denotes F12 amplitudes, F12(C) 
denotes using approximation “C” in the double commutator, and F12+Ai denotes additional singles 
excitations. All energies in E /,. The MP2 aQZ energy is —109.45124 £/,. (The union of ANO- 
RCC-MIN and aQZ MP2 energy is -109.45259 E h ). 


Target Basis 

Error (H) 

Orb. 

Error ( H ) 

F12 F12(C) 

F12+Ai 

ANO-RCC-MIN 

0.5291 

0.0374 

0.3396 

0.2531 

0.1249 

DZ 

0.1865 

0.0324 

0.0787 

0.0157 

0.0453 

TZ 

0.0683 

0.0075 

0.0184 

-0.0364 

0.0107 


For H constructed from the F12 amplitudes, using an aQZ CABS basis, the errors are some¬ 
what larger. (These errors are measured relative to the MP2 aQZ energy. We could measure the 
error relative to the MP2-F12 aQZ energy, but as the difference between the MP2 and MP2-F12 
aQZ energies is small on the scale of errors we are discussing (1.3 m£/ ; for the water molecule) 
this would not change the conclusions. A measure of the CABS basis completeness is given by 
the difference between the F12 and F12(C) columns.) We observe several important things. First, 
the difference between the F12 and F12+Ai columns, shows that the CABS single particle ampli- 



tudes are very important, as they capture orbital relaxation. Second, the FI2 amplitudes lead to 
a significantly less accurate H than the explicit MP2 orbital amplitudes. This is because the F12 
amplitudes do not capture the non-universal part of the short-range correlation. 

3.2 Excitation energies 

The purpose of the effective Hamiltonian, is, of course, not simply to reproduce the ground-state 
calculation from which it is constructed, but to be able to use it in new calculations. We therefore 
now examine the accuracy of the effective Hamiltonians for the excitation energies of water, ni¬ 
trogen, and ethylene. Density matrix renormalization group (DMRG) was used for the water and 
nitrogen effective Hamiltonian and parent bases excited state calculations. The DMRG calcula¬ 
tions used up to M=4000 (with 5 2 symmetry) for both molecules and all states, and are converged 
to microHartree level (l.v electrons in nitrogen were kept frozen for all calculations). Ethylene 
effective Hamiltonian and parent bases excitation energies were computed using the equation of 
motion coupled cluster with connected single and double excitations and a perturbative treatment 
of triples (EOM-CCSD(T)). 

Tables [3] and [4] give the lowest few excitation energies for the water and nitrogen molecules 
using the aforementioned ANO-RCC-MIN target basis, and Dunning’s DZ parent basis. We first 
examine the water molecule. The excitation energies using the bare ANO-RCC-MIN Hamiltonian 
are very poor, with a maximum error of 5.40 eV. The effective Hamiltonian using all orbital am¬ 
plitudes (shifting the active virtual energies to £ a — 0.2 £),) yields a very significant improvement 
with a maximum error now of only 0.45 eV, quite surprising for a calculation in a minimal basis! 
For comparison, if we use FI2 amplitudes the errors of the effective Hamiltonian are essentially 
unchanged with respect to the bare Hamiltonian H, with a maximum error of -5.43 eV. This shows 
that the FI2 amplitudes do not properly capture differential correlation between the ground and 
excited state in this molecule. Some part of the differential correlation is recovered using the addi¬ 
tional A\ amplitudes (F12+Ai), reducing the maximum error to 0.91 eV, but this is still worse than 
the effective Hamiltonian using explicit orbital amplitudes. 
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An important column in the table is the one labelled “no act.”. These show results where “ac¬ 
tive” orbitals are omitted from the amplitudes in the H construction. This has no effect on the 
ground-state energies in the previous section, but as we argued will significantly affect the ex¬ 
citations. We see that this effective Hamiltonian - which is in essence the Hamiltonian used in 
equation-of-motion coupled cluster theories - yields even worse excitation energies than the bare 
Hamiltonian, since it overcorrelates the ground-state and leads to unbalanced treatment of excita¬ 
tions. In equation-of-motion coupled cluster, this imbalance is ameliorated by rediagonalizing in 
the full orbital space (including the external orbitals), not just a small “active” space as is implicitly 
done here by diagonalizing in the minimal basis. 

Column “[[//,A],A]” shows the results of using the full Hamiltonian operator, rather than the 
Fock operator, in the double commutator contribution to H in Eq. (??). Interestingly we find that 
the results using the Fock operator are uniformly better than using the true H. We attribute this to 
some form of error cancellation. 

Table 3: Errors of effective Hamiltonian excitation energies compared to parent basis excitation 
energies of water (computed using DMRG with parameters mentioned in the text). Target (small) 
basis is ANO-RCC-MIN and parent basis is cc-pVDZ. Units are in electron Volts. State notation 
X.Y.Z denotes Multiplicity.Irrep.State (with State being energy ordered). 


State 

Ref. 

DZ 

Bare H 

ANO-RCC-MIN 



H 




Orb. a 

[H,A]Ar 

no act. 

F12 

F12(C) 

F12+Ai 

3.3.1 

7.46 

3.49 

-0.36 

1.34 

6.48 

3.93 

3.88 

-0.91 

1.3.1 

8.13 

4.24 

-0.09 

1.36 

7.23 

4.64 

4.47 

-0.69 

3.2.1 

9.73 

4.56 

-0.08 

1.93 

7.20 

4.85 

4.82 

-0.67 

3.1.1 

9.89 

3.24 

-0.38 

6.86 

6.31 

3.48 

3.40 

-0.86 

1.2.1 

10.15 

5.04 

0.03 

1.92 

7.62 

5.27 

5.14 

-0.55 

1.1.2 

10.80 

4.43 

-0.23 

1.07 

7.66 

4.48 

4.30 

-0.44 

3.4.1 

11.90 

3.59 

-0.28 

1.57 

6.16 

3.83 

3.74 

-0.85 

1.4.1 

12.86 

5.40 

0.45 

2.14 

8.21 

5.43 

5.21 

0.11 


£ a set to —0.20 a.u. 


The excitation energies for nitrogen in Table [4] follow a similar trend to those seen for water. 
The main difference is that the excitation energies from the bare Hamiltonian in the target ANO- 
RCC-MIN basis are not too far from those in the parent DZ basis (maximum error of -1.1 eV). 
We thus expect folding to an effective minimal basis Hamiltonian to yield less of an improvement, 
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Table 4: Errors of effective Hamiltonian excitation energies compared to parent basis excitation 
energies of water (computed using DMRG with parameters mentioned in the text). Target (small) 
basis is ANO-RCC-MIN and parent basis is cc-pVDZ. Units are in electron Volts. State notation 
X.Y.Z denotes Multiplicity.Irrep.State (with State being energy ordered). 


State 

DZ 

ANO-RCC-MIN 



H 




Orb. a 

[tf,A],A] a 

no act. 

F12 

F12(C) 

F12+Ai 

3.6.1 

7.83 

0.57 

0.28 

0.64 

1.90 

0.77 

0.67 

0.33 

3.3.1 

8.12 

-0.56 

0.68 

0.84 

2.58 

-0.47 

-0.57 

0.56 

3.5.1 

9.13 

0.85 

0.42 

0.60 

2.18 

0.95 

0.75 

0.46 

1.3.1 

9.54 

-0.66 

0.77 

0.93 

2.72 

-0.50 

-0.79 

0.50 

3.5.2 

9.93 

0.71 

0.34 

0.48 

2.24 

0.76 

0.52 

0.29 

1.5.1 

10.26 

1.10 

0.54 

0.58 

2.47 

1.12 

0.84 

0.57 

1.6.1 

10.65 

1.05 

0.51 

0.47 

2.46 

1.04 

0.69 

0.47 


a e a set to —0.30 a.u. 


and indeed that is the case. Using the effective Hamiltonian with orbital amplitudes, the maximum 
error is reduced to -0.77 eV. The relative performance of the FI2 amplitudes and orbital amplitudes 
follow a similar trend to what is seen in the water molecule: the F12 amplitudes alone lead to little 
or no improvement in the excitation energies. 

Figures [I] |2] [3] present correlation plots of the bare and effective Hamiltonian excitation ener¬ 
gies, for the low-lying excitation energies of the water, nitrogen, and ethylene molecules. (These 
were generated as the 4 lowest energy states within each irrep, for each multiplicity). We show 
results for the bare Hamiltonian in DZ and TZ bases, and the effective Hamiltonian in a ANO- 
RCC-MIN basis. Statistical data for the excitation energies of water and nitrogen are also given in 
Tables [3] [6} 

All plots clearly show the very substantial improvement brought about by using the effective 
Hamiltonian, rather than the bare Hamiltonian. The excitation energies (points) for the bare target 
basis Hamiltonian are quite far from the exact (parent basis) energies. In the case of water and 
ethylene they are too high, while for nitrogen they are scattered around the parent basis results. 
Folding yields effective Hamiltonians with excitation energies tightly clustered around the parent 
results. 

The figures also show the influence of the choice of energies for the active virtuals. As we 
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saw for the first few excited states above, using no active virtuals in the effective Hamiltonian 
construction leads to very poor excitation energies, worse than using the bare Hamiltonian alone. 
Using active virtuals and shifting them to the HOMO energy greatly improves the results, and 
choosing an optimal energy of the active virtuals improves them further. This is also seen through 
the statistical tables: for example, for water in the DZ basis, using the effective minimal basis 
Hamiltonian reduces the RMSD error from 6.72 eV (bare Hamiltonian) to 1.79 (£ a = HOMO 
energy) and 1.05 eV (£ a — —0.3 eV). For nitrogen, as discussed earlier, the minimal basis excitation 
energies are reasonably accurate, and we observe a less dramatic reduction of the RMSD error, 
from 1.24 eV to 1.01 eV (£ a — —0.3 eV). 

In explicitly correlated theory, it is common to refer to the effect of including an explicit FI2 
correlation factor as increasing the “zeta” level of the calculation, for example, turning a DZ 
ground-state quality calculation into a QZ quality calculation. In a similar way, we ask, if we 
use a very large parent basis, how much does that increase the level of the effective Hamilto¬ 
nian target basis calculation? In Fig.[l]we further compare the errors of excitations (measured from 
TZ basis excitations) with those from an effective ANO-RCC-MIN Hamiltonian derived from the 
TZ parent basis, and an explicit (bare Hamiltonian) DZ calculation. We find that the canonical 
transformation approximately achieves one additional £ level of quality. 

Table 5: Statistical information for correlation plots of the excitation energies of water. R 2 and 
RMSD values. 


Method 

R 2 

RMSD (eY) 

Average deviations (eV) 
Signed Unsigned 

Parent=DZ 

Bare H ANO-RCC-MIN 

0.77 

6.72 

-6.31 

6.31 

H ANO-RCC-MIN no active 

0.65 

10.36 

-9.92 

9.92 

H ANO-RCC-MIN £ a =HOMO 

0.97 

1.79 

-1.34 

1.35 

H ANO-RCC-MIN e a = -0.30 

0.99 

1.05 

-0.07 

0.75 

Parent=TZ 

Bare H ANO-RCC-MIN 

0.68 

7.43 

-6.53 

6.53 

H ANO-RCC-MIN no active 

0.55 

12.12 

-11.17 

11.17 

H ANO-RCC-MIN £ a =HOMO 

0.88 

3.56 

-2.37 

2.46 

H ANO-RCC-MIN £ a = -0.30 

0.95 

2.15 

-0.02 

1.77 
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Figure 1: Correlation plots for excitation energies of the water molecule with the target ANO- 
RCC-MIN effective Hamiltonian, compared to the excitation energies in the parent basis (cc-pVDZ 
(left), cc-pVTZ (right)). The bottom chart compares the excitation energies of a target ANO-RCC- 
MIN effective Hamiltonian (parent cc-pVTZ basis) and excitation energies of a bare cc-pVDZ 
Hamiltonian with the parent cc-pVTZ excitation energies, to see the level of improvement due 
to canonical transformation. (Excitation energies computed using DMRG). 

3.3 Potential energy curves 

As a further test of our effective Hamiltonians, we now study how they perform in describing 
potential energy curves in a minimal (or small) basis. We first consider the nitrogen dimer. In Fig¬ 
ure [4] we show the errors in the ground, first singlet excited, and first triplet excited state potential 
energy curves computed by diagonalizing the effective Hamiltonian in the ANO-RCC-MIN basis, 
as compared to the parent basis. (The effective Hamiltonian is derived for the singlet ground- 
state). For comparison we also show errors of the curves computed using the bare Hamiltonian in 
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Figure 2: Correlation plots for excitation energies of the nitrogen molecule with the target ANO- 
RCC-MIN effective Hamiltonian, compared to the excitation energies in the parent basis (cc-pVDZ 
(left), cc-pVTZ (right)). (Excitation energies computed using DMRG). 



Figure 3: Correlation plots for excitation energies of the ethylene molecule with the target ANO- 
RCC-MIN effective Hamiltonian, compared to the excitation energies in the parent cc-pVDZ basis. 
(Excitation energies computed using EOM-CCSD(T)). 


the ANO-RCC-MIN basis. For each basis, the curves were computed using full valence complete 
active space self-consistent field followed by a canonical transformation with singles and doubles 
excitations dynamic correlation treatment (CASSCF+CTSD). 

As expected, the bare Hamiltonian potential curves in the minimal basis display very large 
errors in the compressed bond region, due to the lack of a second shell to describe polarizing 
effects. In contrast, the effective Hamiltonian yields much smaller errors across the curve (roughly 
half the error at long bond-lengths, and much higher accuracy at short bond lengths). But more 
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Figure 4: Parallelity errors in the ground, first singlet excited, and first triplet excited state us¬ 
ing the bare Hamiltonian H and the effective Hamiltonian H. The curves are generated using 
CASSCF(6,6)+CTSD. Top: parent basis=cc-pVDZ, left: target basis=ANO-RCC-MIN, right: tar¬ 
get basis=ANO-RCC-MIN, Bottom: parent basis=cc-pVTZ, target basis=cc-pVDZ. The reference 
basis for each figure is the parent basis. 
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Table 6: Statistical information for correlation plots of the excitation energies of nitrogen, R 2 and 
RMSD values. 


Method 

R 1 

RMSD (eV) 

Average deviations (eV) 
Signed Unsigned 

Parent=DZ 

Bare H ANO-RCC-MIN 

0.97 

1.24 

-0.61 

0.89 

H ANO-RCC-MIN no active 

0.84 

4.13 

-3.85 

3.85 

H ANO-RCC-MIN e a =HOMO 

0.97 

1.52 

-1.17 

1.17 

H ANO-RCC-MIN e a = -0.30 

0.97 

1.01 

-0.94 

.0.94 

Parent=TZ 

Bare H ANO-RCC-MIN 

0.94 

1.31 

-0.54 

0.99 

H ANO-RCC-MIN no active 

0.80 

4.43 

-4.25 

4.25 

H ANO-RCC-MIN £ a =HOMO 

0.98 

0.99 

-0.80 

0.80 

H ANO-RCC-MIN e a = -0.30 

0.98 

0.95 

-0.75 

0.77 


importantly, the errors are very parallel across the entire range of bond-lengths, and consistent 
between all the states. 

As a more challenging probe of the quality of the chemistry that can be described by two- 
particle Hamiltonians in a minimal basis, we now consider the chromium dimer binding curve. The 
chromium dimer has been the subject of numerous quantum chemical studies due to the difficulties 
in obtaining a potential curve of even qualitatively correct shape. 46 49 It is well known that, in 
addition to the strong correlation arising from the spin coupling of the chromium d electrons, 
very large basis sets are needed. 49 50 Thus constructing a two-particle effective Hamiltonian for a 
minimal basis description is a serious challenge. 

As a parent basis, we used high quality ANO-RCC bases of double-^, triple-^, quadruple-^ 
quality^ 0 supplemented with an additional set of ^/-functions taken from the next £ in the series. 

This yields the following basis set labels and structure: ANO-RCC-DZP+J (21sl5pl0d6f/5s3p3dlf), 
ANO-RCC-TZP+d (21 s 15p 10d6f4g/6s4p4d2f 1 g), and ANO-RCC-QZP+d (21sl5pl0d6f4g2h/7s5p5d3f2glh). 
We used the valence (12,12) active space in the CASSCF calculation. We first tried folding to a 
target ANO-RCC-MIN (21sl5pl0d/4s2pld) basis. While this yielded a bound potential, the bind¬ 
ing energy was 111.4 kcal mok 1 (4.83 eV), several times too large, showing that, at least using 
our procedure, a reasonable effective two-particle Hamiltonian in a strict minimal basis cannot be 
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constructed. We next tried folding to a slightly larger ANO-RCC-MIN +d basis, where an addi¬ 
tional d shell (taken from the double-£ basis (21sl5pl0d/4s2p2d)) was included in the target basis 
to capture polarization effects. The CASSCF density matrix was used in the (1,2) construction of 
H as the Hartree-Fock potential energy surface has an inconvenient curve crossing. 

The experimental curve is shown in Fig [5} The bare Hamiltonian A N O- RCC - MIN +d basis 
results are shown using CASSCF(12,12) and CASSCF(12,12) plus CTSD. Neither of these curves 
show any binding, as expected in a minimal basis. We see, however, that the CTSD calculations 
with the effective Hamiltonian in the ANO-RCC-MIN+c/ basis yield a nicely bound potential en¬ 
ergy curve of a very similar shape and depth to experiment! 

The values for the binding energy, equilibrium distance, and spectroscopic constants, obtained 
using the aforementioned constructed ANO-RCC parent bases, DZP +d, TZP +cl, QZP +d, and the 
target ANO-RCC-MIN+c/ basis, are given in Table [7J Even using the ANO-RCC-DZP+c/ external 
basis we recover binding in the effective ANO-RCC-MIN+c/ Hamiltonian. Folding down from 
the largest QZ parent basis, we obtain for our effective ANO-RCC-MIN+c/ Hamiltonian an R e<? of 
1.83 A and D e of 1.70 eV. This compares quite favorably with experiment. Overall, this shows that 
it is possible to construct an effective Hamiltonian to describe even this very difficult case of the 
binding of the chromium dimer, so long as the minimal basis is slightly expanded. We understand 
this because the minimal basis for Cr 2 with a (12,12) active space does not leave any virtual orbitals 
to be used in a correlated calculation after the folding procedure. In this strongly correlated system, 
the role of the extra set of d-functions is to give 10 virtual orbitals outside the active space that help 
to relax the orbitals. 


4 Conclusions 

In this work we asked whether a simple canonical transformation, using a single-reference-like 
modified second-order perturbation theory formula, yields an effective Hamiltonian in a minimal 
(or very small) basis with qualitatively correct chemistry. As we saw, the answer is in the affirma- 
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Figure 5: Potential energy curves for the chromium dimer for the different methods shown using 
the ANO-RCC-MIN+J basis set. External basis set is ANO-RCC-QZP+r/. 

Table 7: Equilibrium bond distances and dissociation energies for the chromium dimer using an 
effective Hamiltonian in a minimal + d basis, folded down from increasingly large basis sets (DZP, 
TZP, QZP), and for different values of the active virtual energy £ a . 


— CASSCF (H) 
-CT (H) 

— CT (H) 

-o- Experiment 



Basis 

R eq (A) 

De (eV) 

MIN +d (DZP +d) 

£ a =0.00 

2.20 

1.55 

e a =0.05 

2.10 

1.66 

e a =0 .10 

1.90 

1.84 

MlN+d (TZP +d) 

£ fl =0.00 

2.40 

0.75 

e a =0.04 

2.40 

0.78 

£ a =0.05 

2.40 

0.79 

MIN +d (QZP +d) 

£ a =0.00 

1.83 

1.70 

Experiment 

1.679° 

1.47* 

1.45° 

1.56 rf 


a RefP 
b Ref.EH 
c RefP 

d RefP 
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tive: such minimal basis set effective Hamiltonians give qualitatively correct excitation energies 
and binding curves in water, nitrogen, ethylene, and even the chromium dimer! 

Effective Hamiltonians formally provide a conceptual link between quantitative and qualita¬ 
tive reasoning. The simple nature of the construction here means that we can now derive accurate 
effective Hamiltonians for complex systems in practice, including systems with transition metals, 
and for correlated electrons in the condensed phase, where models are essential not only for inter¬ 
pretation but for practical computation. Intriguingly, our various calculations suggests that these 
simple effective Hamiltonians may even be quantitatively accurate. The technique here thus fur¬ 
ther provides the possibility of very low cost (that is lower than multireference perturbation theory) 
treatment of dynamic correlation in challenging multireference quantum chemistry. 
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